Rank in Wordlist | Frequency | Word |
---|---|---|
9357 | 30 | 1,5 |
10212 | 27 | 17,00 |
11532 | 23 | 16,00 |
11957 | 22 | 18,00 |
11958 | 22 | 19,00 |
11960 | 22 | 20,00 |
12893 | 20 | 1,2 |
14043 | 18 | 21,00 |
14651 | 17 | 11,00 |
14655 | 17 | 3,5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
11956 | 22 | 10% |
12423 | 21 | 80% |
12897 | 20 | 20% |
13445 | 19 | 40% |
13446 | 19 | 60% |
14658 | 17 | 70% |
16096 | 15 | 50% |
16943 | 14 | 90% |
17862 | 13 | 30% |
21419 | 10 | 5% |
Rank in Wordlist | Frequency | Word |
---|---|---|
242 | 880 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
46078 | 3 | 'الدولة |
57806 | 2 | ''الدولة |
60195 | 2 | الإسلامية' |
80730 | 1 | ''رافال'' |
80731 | 1 | ''عكاشة |
80732 | 1 | ''يحيى |
80733 | 1 | 'داعش' |
80734 | 1 | 'ذي |
80735 | 1 | 'شيخ |
80736 | 1 | 'مظاهرات |
Rank in Wordlist | Frequency | Word |
---|---|---|
22947 | 9 | 5+1 |
22950 | 9 | 90+2 |
33818 | 5 | 90+4 |
38857 | 4 | 90+3 |
46243 | 3 | 90+1 |
74184 | 2 | مجموعة 5+1 |
80868 | 1 | 1+90 |
81440 | 1 | 2+90 |
82072 | 1 | 45+1 |
82073 | 1 | 45+4 |
Rank in Wordlist | Frequency | Word |
---|---|---|
457 | 536 | حزيران/يونيو |
469 | 528 | تموز/يوليو |
508 | 505 | الثاني/يناير |
549 | 476 | شباط/فبراير |
628 | 429 | الثاني/نوفمبر |
714 | 388 | آذار/مارس |
1316 | 237 | أيار/مايو |
1327 | 235 | أيلول/سبتمبر |
1349 | 231 | ايلول/سبتمبر |
1357 | 230 | اذار/مارس |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots